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Abstract. Let C(n, k) denote the number of connected graphs with n labeled 
vertices and n + k — 1 edges. For any sequence (fc n ), the limit of C(n, k n ) as n 
tends to infinity is known. It has been observed that, if k n = o(y/n), this limit 
is asymptotically equal to the fc n th moment of the area under the standard 
Brownian excursion. These moments have been computed in the literature via 
independent methods. In this article we show why this is true for k n = o( ^/n) 
starting from an observation made by Joel Spencer. The elementary argument 
uses a result about strong embedding of the Uniform empirical process in the 
Brownian bridge proved by Komlos, Major, and Tusnady. 



1. Introduction 

Let C(n, k) denote the number of connected graphs with n labeled vertices and 
n + k — 1 edges. For example, C(n, 0) is the number of labeled trees on n vertices 
and is equal to n n ~ 2 by Cayley's theorem. There is a rich history of the study of the 
asymptotics of the sequence C(n, k). Wright gives the asymptotic formula when k 
is fixed and n — > oo in |14j and when k = o{ y/n) in [15] . Two different approaches 
were taken in analyzing the case when both n,k — > oo, one by Bender, Canfield, 
and McKay 0, and the other by Coja-Oghlan, Moore, and Sanwalani [3], and van 
der Hofstad and Spencer [5]. 

When k — o(y/n) it has been observed that these limits (upto scaling) are also 
given by the moments of the area under a standard Brownian excursion. A stan- 
dard Brownian excursion is a random element taking values in the subset of all 
nonnegative continuous functions on the interval [0, 1] (denoted by C[0, 1]) given 

by 

{uj e C[0, 1] : w(0) = w(l) = 0, and u(t) > for all t € (0, 1)} . 

Informally, one can describe this process as a standard Brownian motion (starting 
from zero) conditioned to return to zero for the first time at time one. We refer 
the readers to the book by Revuz and Yor for a proper introduction. Another 
description of combinatorial interest is that a standard Brownian excursion is the 
contour of the Brownian Continuum Random Tree defined by Aldous [T]. In any 
case, the area under this random continuous curve is well-defined and measurable. 
Exact and asymptotic formulas for the moments of this random area can be found 
in the article by Louchard [8] and the recent survey by Janson [6]. 
Our main result is the following. 

Theorem 1. Let A be a random variable whose law is given by the area under the 
standard Brownian excursion. Then, for all (n,k) such that k = k n = o(y / n) and 
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n — > oo we have 

lim ^ C ^ k ^ 1 

n->oo n n+3fc/2-2 £ J^k 

It can be found in [5J Page 90, eqn. (53)] that 

/ k \ k/2 

(1) EA k ~3V2k(—j , asfc^oo. 

Hence, the previous theorem reproves the result of [3] and [5j Sec 2.1] (when k = 

(2) C(n, ft) ~ V fc / 2 (e/12fc) fc/2 (3/Ttt) ft 1/2 . 

The first explanation between the connection of C(n, ft) with the area under the 
standard Brownian excursion was given in a beautiful paper by Spencer |12j . Let 
Zi,...,Z n be independent Poisson random variables with mean one. Let Yj = 
J2l=i Zi denote the partial sum process. Define the queue walk by Qo = 1, and 

(3) Qi=Q i -i + (Z i -l) = Y i -(i-l), i = l,2,...,n. 

Define the event Exc := {Qi > 0, i = 1, . . . , n — 1, and Q n = 0}. Consider the 
empirical area 

n— 1 n— 1 

M = £ (Qi - 1) = E - • 

i=l i=l 

Let £J* denote the expectation conditioned on the event Exc. Then (see [TH The- 
orem 3.2]) the following exact relationship holds 



(4) C(n, ft) = n"" 2 ^ 



Now, under proper scaling, the law of the queue walk under Exc converges in 
distribution to the law of a standard Brownian excursion. The area under the curve 
is a continuous function of the curve under the uniform distance. And hence, under 
proper scaling, which is dividing by n 3 / 2 , M converges in distribution to the area 
under the standard Brownian excursion. 

The original article by Wright [14, , had identified the limiting value of C(n, ft), 
when ft is fixed and as n tends to infinity, as n n ~ 2 n 3k ^ 2 Ck/kl, where this sequence 
of constants (cfe, k G N) came to be known as Wright's constants. 

Spencer observed, but did not prove, that if the weak convergence of M to that 
of A can be strengthened to a convergence of moments of fixed order, this would 
explain the factor of n 3k ^ 2 in Wright's expression, and provide an interpretation of 
Wright's constants as the moment sequence of A. 

The limits of all sequences C{n,k n ) have been derived in [2] by analytic com- 
binatorial methods, and in [3l |5] by probabilistic methods. For any sequence 
k n = o(y / n), they are still given by the formula ([2]). However, these methods 
neither employ nor shed any light on the Brownian approximation. 

The main result in this article argues this in the regime of k n — o(y/n). Our 
main tool is an explicit coupling that is made feasible by a strong approximation 
result due to Komlos, Major, and Tusnady (KMT). This strong approximation 
result is the Brownian bridge version of the more famous KMT embedding that 
approximates a random walk by a Brownian motion. 
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Interestingly, the proof breaks down beyond k n — o(\/n) for technical reasons 
and cannot be improved by the currently known version of the KMT result. It is 
curious that this is the same regime that Wright [15 could extend his argument. 
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2. Proof of the main result 

We will use the following notation throughout: For any event A, the random 
variable 1{A] takes the value one if A occurs, and takes the value zero otherwise. 

Theorem 2. Consider the queue walk and the event Exc from pj). Consider the 
rescaled continuous time process 

(5) X n (t) = 1— {Qi - 1) , i/(n-l)<t<(t+l)/(n-l), i = 0,...n-2. 
\/n — 1 

Also define X n (l) = 0. Then X n is a path, defined on [0, 1], that starts and ends 
at zero. Consider the empirical area 



~i , n-i 

(6) M n := / X n (t)dt = , lW2 £ (Qi ~ 1 



(n - l) 3 / 2 ^ 

Let E* denote the expectation conditioned on the event Exc. Let A be the area 
under a standard Brownian excursion. For all k = k n = o(^fn), we have 

v E* [M n f 

lim —r — = 1. 

n— >oo EA 



Let us give an outline of the steps of the proof backwards in the order in which 
they appear. 

Step 3. We prove an embedding of the partial sum process, conditioned on Exc, in 

a standard Brownian excursion and estimate errors. 
Step 2. Step 3 follows by a known KMT embedding of the empirical process (to be 

defined later) in a Brownian bridge. 
Step 1. We describe how Step 3 follows from Step 2 by re-rooting at the minimum. 

We now expand each step of the proof. 

2.1. Re-rooting at the minimum. Consider a deterministic sequence of numbers 
x := (x\, . . . , x n ). The walk with steps x is the sequence of partial sums of a; starting 
at zero. That is 

s = 0, Si = Si-i+Xi, £=1,2, ... 

For i G {1, 2, . . . , n} let x^ denote the ith cyclic shift of x, that is the sequence of 
length n whose jth term is Xi+j where i + j refers to (i + j) mod n. The following 
lemmas stem from the classical ballot problem and is used by Takacs in [T3] to 
prove Kemperman's formula. For more details and the proofs see [TUl Section 6.1]. 
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Lemma 3. Let x := (x\, . . . , x n ) be a sequence with values in {— 1, 0, 1, . . .} and 
sum — 1. Let a — min{i : Sj = minKj<„ Sj}, i.e., the first time the walk reaches 
its absolute minimum. Then the walk with steps x^ hits — 1 for the first time at 
step n. Moreover, this is the only i such that the walk with steps x^ hits — 1 for 
the first time at step n. 

We say the path S with steps a; has been re-rooted at the minimum to denote 
the path R(S) with steps x^'. The following lemma, called the discrete Vervaat's 
transform, follows from above and the exchangeability of increments [TDJ page 125]. 

Lemma 4. Suppose X = (Xi, . . . ,X n ) denote a sequence of iid random variables 
with values in {— 1, 0, 1, . . .}. Let So = 0. and Si+i = Si+Xi + \, for i = 0, 1, . . . , n— 
1. We refer to the sequence S = (So, Si, ... , S n ) as a path. Let Bridge denote the 
event S n = — 1. Then, if S is a path, conditioned on Bridge, then R(S) is distributed 
as a path conditioned on Exc = {S n = — 1, Si > 0, 1 < i < n}. 

A continuous version of this can be easily guessed [10] page 125]. Recall that the 
Brownian bridge is a standard BM conditioned to be zero at time one. 

Lemma 5. Let a be the (almost surely) unique time when a Brownian bridge B 
achieves its global minimum. Then the process given by the cyclic transformation 
{B a+t — B a , < t < 1} (understood modi,) is a standard Brownian excursion. 

To use these results for our problem we need to understand the event Bridge for 
the queue walk. Consider a unit rate Poisson point process on the half line [0, oo). 
That is, consider iid Exponential random variables {Ti, i € N} with mean one, and 
consider their partial sums So = 0, and S n = X^Li^' n — 1- We say an 'event' 
occurs at each 'time point' S n . Then, one can define Zi to be the number of events 
occurred during the interval [i — 1, i), for i > 1. This maintains that Z\,Zi, . . . are 
iid Poisson random variables with mean one. 

The event {Q n = 0} = {Y n = n — 1} is equivalent to the statement that there 
are (n — 1) events in time interval [0,n]. It is well-known, that conditioned on 
the number of events in a given interval, their points of occurrences, under the 
Poisson process, are independently and uniformly distributed. Thus, under Bridge, 
the times of occurrences of the (n — 1) events are given by (n — 1) iid Uniform(0, 1) 
random variables multiplied by n. 

Let Ui, U2, . . . , denote a countable sequence of iid Uniform(0, 1) random vari- 
ables. Consider the process 

(7) F n _i(t):=5^1{E7i<t}-ni, te[0,l]. 

Then the sequence (F„_ 1 (fc/n), k — 0,1,2, ... ,n) has the same law as (Qk — l,k = 
0, 1, 2, . . . , n) under Bridge. Note that we have suppressed the dependence of n from 
Qk- In what follows the correct n will be obvious from the context. Also define 
what is known as the empirical process 

(8) G„(t):=^( Tl{f/i<t}-t ) =^=(F n (t)+t), te[0,l}. 
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2.2. Strong embedding for the empirical process. The famous Komlos-Major- 
Tusnady (KMT) paper [7] contains a result on the strong embedding of the empiri- 
cal process in the Brownian bridge. Several variations of the result have since been 
discovered due to its importance in the empirical process theory. See [3], [5]. We 
take the statement of the following lemma from [9]. 

Lemma 6. There is a probability space on which one can define random variables 
Ui, Z7 2 > . . . that are iid Uniform (0, 1) and a sequence of processes B\, B 2 , ■ ■ ■ that 
are each distributed as a standard Brownian bridge such that if we define 

A 1 :=A 1 (n)= sup \G n (s) - B n (s)\ , 

0<s<l 

then there exist universal positive constants a, K, A such that 

(9) P(y/nA 1 > a log 7i + x) < Ke~ Xx , for all x € [0, oo). 

Consider now a path of G n and B n as defined on the probability space above. We 
now have three processes: the process X n+ i as defined in (J5|, the empirical process 
G n and the Brownian bridge B n . We know by Lemma[6]that the supremum distance 
between the processes G n and B n is Aj. 

We now estimate the supremum distance between X n+ i and G n . Note from the 
line following ^ 

1 / k \ k k + 1 

X n+1 (t) = -=F n — - , __<«<__, k = 0,1, ...,n. 
sjn \ n + 1 J n + 1 n + 1 

Subtracting, from ([8]), we get 

v n + 1 

Using d8b again, for fc/(n + 1) < t < (k + l)/(n + 1), we have 



\X n+1 (t) - G n (t)\ < -L 
\ n 



F n {t) - F n 



\X n+1 (t) - G n (t)\ < 



G n (t) — G n 



k 



1 



(n + l)\/n 

Taking supremum on both sides above and using Lemma [6] we get 
A 2 := sup \X n+1 (t) - G n (t)\ 

Q<t<l 

<^=+ sup sup \B n (t)-B n (k/(n + l))\ + 2A 1 . 

V n 0<k<n fe<(n+l)t<fc+l 

Express the Brownian bridge B n as (f3 n (t)—tf3 n (l), < t < 1), for some standard 
Brownian motion /3„ (see [TTJ page 37]). Then 

sup \B n (t)-B n (k/{n+l))\< 

fc/(n+l)<t<(fc+l)/(n + l) 

sup ^-^/(n + ijJl + JMll. 

fc/(n+l)<t<(fc+l)/(n+l) n + 1 

By the Markov property of Brownian motion we know that for each fc, the quantity 
|Z fe |/VrTTT:= sup \Pn(t)-PnW(n+l))\ 

k / (n+l)<t<(k+l) / (n+l) 

are independent and identically distributed. In fact, by the stationary increment 
property of Brownian motion and scaling, this distribution is the same as that 
of |/3 n |* = sup 0<s<1 |/3„(s)|. Moreover, by Paul Levy's Characterization Theorem 
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Figure 1. Re- rooting at the minimum 



([TTJ page 240]), we know that j3 n := sup^^ f3 n (s) and fin := supo^,,^ -/3„(s) 
are both distributed as the absolute value of a standard Normal random variable 
N. Observe that, for any positive x, 

P(\N\ >x) = P(fa >x)<P (\p n \* >x)< P{fi~ n > x)+P (§„>x)= 2P(\N\ > x). 

Thus the distribution of each (and all their moments) are comparable to that of 
the absolute value of the standard Normal. We will use this fact implicitly in the 
following argument. In any case 

an 



1 



V«+ 1 "<*< 

Hence, the supremum distance between the continuous time walk X n+ i and the 
Brownian bridge 
(10) 

( „) := sup \X n+1 {t) -B n (t)\ < A!+A 2 = -L+ J max IZ,; 1 1 1 



A, < 



2Ai. 



A 



0<*<1 



\/n+ 1 o<i< 



1 



3Ai. 



We now re-root at the minimum both the continuous time walk X n+ i and the 
Brownian bridge B n . Please see Figure [l] where the smooth curve is the sine curve 
which is approximated by a jagged path. By (10) their minimums differ by A/ n ) 
and can be attained at different times a w (for the walk) and a b (for the Brownian 
bridge). Depending on how close a w and <7& are, after re-rooting we get a walk 
excursion and a standard Brownian excursion which might be a little off. However, 
this does not affect the total area. 

To see what we mean, suppose y is a continuous curve on [0, 1] such that y(0) = 
2/(1) = and with an absolute minimum y m - m . Then the area under the curve 
y(-) := y(-) + y m ; n is equal to the area under the curve that is obtained by re- 
rooting y at its minimum. The difference between the minimums of the walk and 
the Brownian bridge is at most A( n ), which is also the uniform distance between 
the two curves. Thus the area between the two re-rooted curves differ by at most 
2A(„). We have the following lemma. 

Theorem 7. Consider the set-up in Theorem^ On the KMT space described in 
Lemma [6j it is possible to have for each n G N a copy of M n , under Exc, and 
a random variable A n> distributed according to the area of a standard Brownian 
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excursion such that if D„ = \M n — A n \, then there exists two absolute positive 
constants C\ and C2 such that for all k,n <= N we have 



[ED*] 1/k < Cl 



logn 



C 2 



Proof of Theorem^ From (10 1, the discussion above, and by using the triangle 
inequality we get 

(11) 



{ED*) 



l/k 



< 2 



E 



< 2 



< 



fn \Jn + 1 o<i<n 



E ^max|Zj| 



n+1 



i/fc 



l/k 



n+1 



E\Pn(l)\' 



l/k 



3 (EA* 



E (max I I 



V i 



i/k 



n + 1 



E\M1)\' 



l/k 



6 (EAf 



fe\V* 



Let = fimax, \Zi\. It is well-known that fj, n — 0{\/\og n). From the Gaussian 
concentration of measure we know that the probability density of |maxj \Zj\ — fj, n \ 
has a sub-Gaussian tail with variance 1. Thus, from known moments of the standard 
Normal distribution we infer that 



E ( max I Zi 



\ i 



l/k 



< 



fJ-n 



E 



max|Zi| - fx n 



l/k 



< [l n + C*Vk, 



(E\(3 n (l)\' 



l/k 



< CVfc, 



where C* is some absolute constant. 

Further, note that Ai = sup s \G n (s) — B n (s)\. Now 

\/nAi < a log n + (Ai — a log n) + , 

where x + = xl{x > 0}. Again using the triangle inequality for the fcth norm and 
the bound from we get 

l/k 



^(EAl) 1 '* < alogn 
Thus 



kK 



dx 



= a\ogn + K l / k \- l {k\) 1 ' k . 



\EA k ] 1/k < al ° gn + 0<yk ) 
Substituting in ( p~T[ ) we get our desired result. 
Proof of Theorem [2| From the last theorem we know that 



□ 



(12) 



(ED k ) 1/k < Cl 



logn 



Co 



From here one can estimate the deviation of moments. By elementary calculus, 

1 



(13) I 
Choose e such that 



(l + e) fe -l| <2ft|e|, for all |e| < 



(l + e)= \E(M n ) k ] 1/k /\E{A n f 



2k 

l/k 
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Clearly then 



E (M n y 



i/fe 



E{A n f 



l/k 



e < 



E{A ri 



l/k 



< 



E(D n y 

~E(A n ) k 



l/k 



l/k 



Note that A n , being the area under a standard Brownian excursion, has a law 
that does not depend on n. From ([I]) we claim the existence of another absolute 
constant C3 > such that for all k £ N, 

i/k 

> C 3 Vk. 



Thus 



e <0 




Thus ke converges to zero for all sequences such that k — o(-\fn). This finishes 
the proof. □ 

Proof of Theorem^ To finish the proof of Theorem[l]we simply need to argue that 
when k = o(\/n), then 



E* 



lim k\- 



1 



«^oo ' E* (M k ) 

and use Q. However, this follows from elementary bounds since M = 0(n 3 / 2 ) and 
k = o{^/n). We skip the details. 



□ 
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